Automatic Summarization from Multiple Documents (Extended Abstract)
نویسنده
چکیده
Since the late 50’s and Luhn [Luh58] the information community has expressed its interest in summarizing texts. The domains of application of such methodologies are countless, ranging from news summarization [WL03, BM05, ROWBG05] to scientific article summarization [TM02] and meeting summarization [NPDP05, ELH03]. Summarization has been defined as a reductive transformation of a given set of texts, usually described as a three-step process: selection of salient portions of text, aggregation of the information over various selected portions, abstraction of this information to a more general level, and finally presentation of the final summary text [MB99, Jon99]. The summarization community nowadays includes a significant number of scientists with increasing interest in the multi-document aspect of summarization. Major issues towards multi-document summarization that have been identified by the researchers include the detection of (possibly query-related) salient information in texts, the preservation of coherence and linguistic quality in summarized text, the control of redundancy in the summaries, and the language independence of summarization methods. Up to date, many summarization systems have been developed and presented, especially within such endeavours as the Document Understanding Conferences (DUC) and Text Analysis Conferences (TAC). The DUC and TAC have strengthened the summarization community and have helped in identifying
منابع مشابه
A survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملBiogeography-Based Optimization Algorithm for Automatic Extractive Text Summarization
Given the increasing number of documents, sites, online sources, and the users’ desire to quickly access information, automatic textual summarization has caught the attention of many researchers in this field. Researchers have presented different methods for text summarization as well as a useful summary of those texts including relevant document sentences. This study select...
متن کاملAutomatic Summarization from Multiple Documents
This work reports on research conducted on the domain of multi-document summarization using background knowledge. The research focuses on summary evaluation and the implementation of a set of generic use tools for NLP tasks and especially for automatic summarization. Within this work we formalize the n-gram graph representation and its use in NLP tasks. We present the use of n-gram graphs for t...
متن کاملImpact of Document Structure on Hierarchical Summarization
Hierarchical summarization technique summarizes a large document based on the hierarchical structure and salient features of the document. Previous study has shown that hierarchical summarization is a promising technique which can effectively extract the most important information from the source document. Hierarchical summarization has been extended to summarization of multiple documents. Thre...
متن کاملLiterature Review on Automatic Text Summarization: Single and Multiple Summarizations
The online information available on world wide web is in enormous amount. Search engines like Google, Yahoo were developed to retrieve information from the databases. But actual results were not obtained as the electronic information is increasing day by day. Thus automatic summarization came into demand. Automatic summarization gathers several documents as input and provides the shorter summar...
متن کامل